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The  2002-3  pandemic  caused  by  severe  acute  respiratory  syndrome 
coronavirus  (SARS-CoV)  was  one  of  the  most  significant  public  health 
events  in  recent  history1.  An  ongoing  outbreak  of  Middle  East  respira¬ 
tory  syndrome  coronavirus2 *  suggests  that  this  group  of  viruses  remains 
a  key  threat  and  that  their  distribution  is  wider  than  previously  recog¬ 
nized.  Although  bats  have  been  suggested  to  be  the  natural  reservoirs 
of  both  viruses3-5,  attempts  to  isolate  the  progenitor  virus  of  SARS- 
CoV  from  bats  have  been  unsuccessful.  Diverse  SARS-like  corona- 
viruses  (SL-CoVs)  have  now  been  reported  from  bats  in  China, 
Europe  and  Africa5-8,  but  none  is  considered  a  direct  progenitor 
of  SARS-CoV  because  of  their  phylogenetic  disparity  from  this  virus 
and  the  inability  of  their  spike  proteins  to  use  the  SARS-CoV  cellular 
receptor  molecule,  the  human  angiotensin  converting  enzyme  II 
(ACE2)9,10.  Here  we  report  whole-genome  sequences  of  two  novel  bat 
coronaviruses  from  Chinese  horseshoe  bats  (family:  Rhinolophidae) 
in  Yunnan,  China:  RsSHC014  and  Rs3367.  These  viruses  are  far  more 
closely  related  to  SARS-CoV  than  any  previously  identified  bat  coro¬ 
naviruses,  particularly  in  the  receptor  binding  domain  of  the  spike 
protein.  Most  importantly,  we  report  the  first  recorded  isolation  of 
a  live  SL-CoV  (bat  SL-CoV-WIVl)  from  bat  faecal  samples  in  Vero 
E6  cells,  which  has  typical  coronavirus  morphology,  99.9%  sequence 
identity  to  Rs3367  and  uses  ACE2  from  humans,  civets  and  Chinese 
horseshoe  bats  for  cell  entry.  Preliminary  in  vitro  testing  indicates 
that  WI V 1  also  has  a  broad  species  tropism.  Our  results  provide  the 
strongest  evidence  to  date  that  Chinese  horseshoe  bats  are  natural 
reservoirs  of  SARS-CoV,  and  that  intermediate  hosts  may  not  be 
necessary  for  direct  human  infection  by  some  bat  SL-CoV s.  They  also 
highlight  the  importance  of  pathogen-discovery  programs  targeting 
high-risk  wildlife  groups  in  emerging  disease  hotspots  as  a  strategy 
for  pandemic  preparedness. 

The  2002-3  pandemic  of  SARS1  and  the  ongoing  emergence  of  the 
Middle  East  respiratory  syndrome  coronavirus  (MERS-CoV)2  demon¬ 
strate  that  Co  Vs  are  a  significant  public  health  threat.  SARS-CoV  was 
shown  to  use  the  human  ACE2  molecule  as  its  entry  receptor,  and  this 
is  considered  a  hallmark  of  its  cross-species  transmissibility11.  The  receptor 
binding  domain  (RBD)  located  in  the  amino -terminal  region  (amino 
acids  318-510)  of  the  SARS-CoV  spike  (S)  protein  is  directly  involved 
in  binding  to  ACE2  (ref.  12).  However,  despite  phylogenetic  evidence 
that  SARS-CoV  evolved  from  bat  SL-CoVs,  all  previously  identified 
SL-CoVs  have  major  sequence  differences  from  SARS-CoV  in  the  RBD 
of  their  S  proteins,  including  one  or  two  deletions6,9.  Replacing  the  RBD 
of  one  SL-CoV  S  protein  with  SARS-CoV  S  conferred  the  ability  to  use 
human  ACE2  and  replicate  efficiently  in  mice9,13.  However,  to  date,  no 
SL-CoVs  have  been  isolated  from  bats,  and  no  wild-type  SL-CoV  of  bat 
origin  has  been  shown  to  use  ACE2. 

We  conducted  a  12 -month  longitudinal  survey  (April  201 1 -September 
2012)  of  SL-CoVs  in  a  colony  of  Rhinolophus  sinicus  at  a  single  location 


in  Kunming,  Yunnan  Province,  China  (Extended  Data  Table  1).  A  total 
of  1 17  anal  swabs  or  faecal  samples  were  collected  from  individual  bats 
using  a  previously  published  method5,14.  A  one-step  reverse  transcrip¬ 
tion  (RT) -nested  PCR  was  conducted  to  amplify  the  RNA-dependent 
RNA  polymerase  (RdRP)  motifs  A  and  C,  which  are  conserved  among 
alphacoronaviruses  and  betacoronaviruses15. 

Twenty-seven  of  the  117  samples  (23%)  were  classed  as  positive  by 
PCR  and  subsequently  confirmed  by  sequencing.  The  species  origin  of 
all  positive  samples  was  confirmed  to  be  R.  sinicus  by  cytochrome  b 
sequence  analysis,  as  described  previously16.  A  higher  prevalence  was 
observed  in  samples  collected  in  October  (30%  in  2011  and  48.7%  in 
2012)  than  those  in  April  (7.1%  in  201 1)  or  May  (7.4%  in  2012)  (Extended 
Data  Table  1).  Analysis  of  the  S  protein  RBD  sequences  indicated  the 
presence  of  seven  different  strains  of  SL-CoVs  (Fig.  la  and  Extended 
Data  Figs  1  and  2).  In  addition  to  RBD  sequences,  which  closely  matched 
previously  described  SL-CoVs  (Rs672,  Rfl  and  HKU3)5,8,17,18,  two  novel 
strains  (designated  SL-CoV  RsSHC014  and  Rs3367)  were  discovered. 
Their  full-length  genome  sequences  were  determined,  and  both  were 
found  to  be  29,787  base  pairs  in  size  (excluding  the  poly(A)  tail).  The 
overall  nucleotide  sequence  identity  of  these  two  genomes  with  human 
SARS-CoV  (Tor2  strain)  is  95%,  higher  than  that  observed  previously 
for  bat  SL-CoVs  in  China  (88-92%)5,8, 17,18  or  Europe  (76%)6  (Extended 
Data  Table  2  and  Extended  Data  Figs  3  and  4).  Higher  sequence  iden¬ 
tities  were  observed  at  the  protein  level  between  these  new  SL-CoVs 
and  SARS-CoVs  (Extended  Data  Tables  3  and  4).  To  understand  the 
evolutionary  origin  of  these  two  novel  SL-CoV  strains,  we  conducted 
recombination  analysis  with  the  Recombination  Detection  Program 
4.0  package19  using  available  genome  sequences  of  bat  SL-CoV  strains 
(Rfl,  Rp3,  Rs672,  Rml,  HKU3  and  BM48-31)  and  human  and  civet 
representative  SARS-CoV  strains  (BJ01,  SZ3,  Tor2  and  GZ02).  Three 
breakpoints  were  detected  with  strong  P  values  (<10_2°)  and  supported 
by  similarity  plot  andbootscan  analysis  (Extended  Data  Fig.  5a,  b).  Break¬ 
points  were  located  at  nucleotides  20,827,  26,553  and  28,685  in  the 
Rs3367  (and  RsSHC014)  genome,  and  generated  recombination  frag¬ 
ments  covering  nucleotides  20,827-26,533  (5,727  nucleotides)  (inclu¬ 
ding  partial  open  reading  frame  (ORF)  lb,  full-length  S,  ORF3,  E  and 
partial  M  gene)  and  nucleotides  26,534-28,685  (2,133  nucleotides) 
(including  partial  ORF  M,  full-length  ORF6,  ORF7,  ORF8  and  partial 
N  gene).  Phylogenetic  analysis  using  the  major  and  minor  parental  regions 
suggested  that  Rs3367,  or  RsSHCO  14,  is  the  descendent  of  a  recombination 
of  lineages  that  ultimately  lead  to  SARS-CoV  and  SL-CoV  Rs672  (Fig.  lb). 

The  most  notable  sequence  differences  between  these  two  new  SL- 
CoVs  and  previously  identified  SL-CoVs  is  in  the  RBD  regions  of  their 
S  proteins.  First,  they  have  higher  amino  acid  sequence  identity  to  SARS- 
CoV  (85%  and  96%  for  RsSHCO  14  and  Rs3367,  respectively).  Second, 
there  are  no  deletions  and  they  have  perfect  sequence  alignment  with 
the  SARS-CoV  RBD  region  (Extended  Data  Figs  1  and  2).  Structural 
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Figure  1  |  Phylogenetic  tree  based  on  amino  acid  sequences  of  the  S  RBD 
region  and  the  two  parental  regions  of  bat  SL-CoV  Rs3367  or  RsSHC014. 

a,  SARS-CoV  S  protein  amino  acid  residues  310-520  were  aligned  with 
homologous  regions  of  bat  SL-CoVs  using  the  ClustalW  software.  A  maximum- 
likelihood  phylogenetic  tree  was  constructed  using  a  Poisson  model  with 
bootstrap  values  determined  by  1,000  replicates  in  the  MEGA5  software  package. 
The  RBD  sequences  identified  in  this  study  are  in  bold  and  named  by  the  sample 
numbers.  The  key  amino  acid  residues  involved  in  interacting  with  the  human 
ACE2  molecule  are  indicated  on  the  right  of  the  tree.  SARS-CoV  GZ02,  BJ01  and 
Tor2  were  isolated  from  patients  in  the  early,  middle  and  late  phase,  respectively, 
of  the  SARS  outbreak  in  2003.  SARS-CoV  SZ3  was  identified  from  Paguma 
larvata  in  2003  collected  in  Guangdong,  China.  SL-CoV  Rp3,  Rs672  and  HKU3-1 
were  identified  from  R.  sinicus  collected  in  China  (respectively:  Guangxi,  2004; 
Guizhou,  2006;  Hong  Kong,  2005).  Rfl  and  Rml  were  identified  from 


R.  ferrumequinum  and  R.  macrotis,  respectively,  collected  in  Hubei,  China,  in 
2004.  Bat  SARS-related  CoV  BM48-31  was  identified  from  R.  blasii  collected  in 
Bulgaria  in  2008.  Bat  CoV  HKU9-1  was  identified  from  Rousettus  leschenaultii 
collected  in  Guangdong,  China  in  2005/2006  and  used  as  an  outgroup.  All 
sequences  in  bold  and  italics  were  identified  in  the  current  study.  Filled  triangles, 
circles  and  diamonds  indicate  samples  with  co- infection  by  two  different 
SL-CoVs.  indicates  the  amino  acid  deletion,  b,  Phylogenetic  origins  of  the  two 

parental  regions  of  Rs3367  or  RsSHCOI 4.  Maximum  likelihood  phylogenetic 
trees  were  constructed  from  alignments  of  two  fragments  covering  nucleotides 
20,827-26,533  (5,727  nucleotides)  and  26,534  -28,685  (2,133  nucleotides)  of  the 
Rs3367  genome,  respectively.  For  display  purposes,  the  trees  were  midpoint 
rooted.  The  taxa  were  annotated  according  to  strain  names:  SARS-CoV,  SARS 
coronavirus;  SARS-like  CoV,  bat  SARS-like  coronavirus.  The  two  novel  SL-CoVs, 
Rs3367  and  RsSHC014,  are  in  bold  and  italics. 


and  mutagenesis  studies  have  previously  identified  five  key  residues 
(amino  acids  442, 472, 479, 487  and  491)  in  the  RBD  of  the  SARS-CoV 
S  protein  that  have  a  pivotal  role  in  receptor  binding20,21.  Although  all 
five  residues  in  the  RsSHC014  S  protein  were  found  to  be  different 
from  those  of  SARS-CoV,  two  of  the  five  residues  in  the  Rs3367  RBD 
were  conserved  (Fig.  1  and  Extended  Data  Fig.  1). 

Despite  the  rapid  accumulation  of  bat  CoV  sequences  in  the  last 
decade,  there  has  been  no  report  of  successful  virus  isolation6,22,23.  We 
attempted  isolation  from  SL-CoV  PCR-positive  samples.  Using  an 
optimized  protocol  and  Vero  E6  cells,  we  obtained  one  isolate  which 
caused  cytopathic  effect  during  the  second  blind  passage.  Purified  virions 
displayed  typical  coronavirus  morphology  under  electron  microscopy 
(Fig.  2).  Sequence  analysis  using  a  sequence-independent  amplifica¬ 
tion  method14  to  avoid  PCR- introduced  contamination  indicated  that 
the  isolate  was  almost  identical  to  Rs3367,  with  99.9%  nucleotide  genome 
sequence  identity  and  100%  amino  acid  sequence  identity  for  the  SI 
region.  The  new  isolate  was  named  SL-CoV-WIVl. 


To  determine  whether  WIV 1  can  use  ACE2  as  a  cellular  entry  receptor, 
we  conducted  virus  infectivity  studies  using  HeLa  cells  expressing  or 
not  expressing  ACE2  from  humans,  civets  or  Chinese  horseshoe  bats. 
We  found  that  WIV  1  is  able  to  use  ACE2  of  different  origins  as  an  entry 
receptor  and  replicated  efficiently  in  the  ACE2- expressing  cells  (Fig.  3). 
This  is,  to  our  knowledge,  the  first  identification  of  a  wild-type  bat  SL- 
CoV  capable  of  using  ACE2  as  an  entry  receptor. 

To  assess  its  cross-species  transmission  potential,  we  conducted  infec¬ 
tivity  assays  in  cell  lines  from  a  range  of  species.  Our  results  (Fig.  4  and 
Extended  Data  Table  5)  indicate  that  bat  SL-CoV- WIV  1  can  grow  in 
human  alveolar  basal  epithelial  (A549),  pig  kidney  15  (PK-15)  and 
Rhinolophus  sinicus  kidney  (RSKT)  cell  lines,  but  not  in  human  cervix 
(HeLa),  Syrian  golden  hamster  kidney  (BHK2 1),  Myotis  davidii  kidney 
(BK),  Myotis  chinensis  kidney  (MCKT),  Rousettus  leschenaulti  kidney 
(RLK)  or  Pteropus  alecto  kidney  (PaKi)  cell  lines.  Real-time  RT-PCR 
indicated  that  WIV1  replicated  much  less  efficiently  in  A549,  PK-15 
and  RSKT  cells  than  in  Vero  E6  cells  (Fig.  4). 
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Figure  2  |  Electron  micrograph  of  purified  virions.  Virions  from  a  10-ml 
culture  were  collected,  fixed  and  concentrated/purified  by  sucrose  gradient 
centrifugation.  The  pelleted  viral  particles  were  suspended  in  100  pi  PBS, 
stained  with  2%  phosphotungstic  acid  (pH  7.0)  and  examined  directly  using  a 
Tecnai  transmission  electron  microscope  (FEI)  at  200  kV. 

To  assess  the  cross-neutralization  activity  of  human  SARS-CoV  sera 
against  WIV1,  we  conducted  serum-neutralization  assays  using  nine 
convalescent  sera  from  SARS  patients  collected  in  2003.  The  results 
showed  that  seven  of  these  were  able  to  completely  neutralize  100  tissue 


culture  infectious  dose  50  (TCID50)  WIV1  at  dilutions  of  1:10  to  1:40, 
further  confirming  the  close  relationship  between  WIV 1  and  SARS-CoV. 

Our  findings  have  important  implications  for  public  health.  First, 
they  provide  the  clearest  evidence  yet  that  SARS-CoV  originated  in  bats. 
Our  previous  work  provided  phylogenetic  evidence  of  this5,  but  the  lack 
of  an  isolate  or  evidence  that  bat  SL-CoVs  can  naturally  infect  human 
cells,  until  now,  had  cast  doubt  on  this  hypothesis.  Second,  the  lack  of 
capacity  of  SL-CoVs  to  use  of  ACE2  receptors  has  previously  been 
considered  as  the  key  barrier  for  their  direct  spillover  into  humans,  suppor¬ 
ting  the  suggestion  that  civets  were  intermediate  hosts  for  SARS-CoV 
adaptation  to  human  transmission  during  the  SARS  outbreak24.  However, 
the  ability  of  SL-CoV-WIVl  to  use  human  ACE2  argues  against  the 
necessity  of  this  step  for  SL-CoV-WIVl  and  suggests  that  direct  bat- 
to-human  infection  is  a  plausible  scenario  for  some  bat  SL-CoVs.  This 
has  implications  for  public  health  control  measures  in  the  face  of  poten¬ 
tial  spillover  of  a  diverse  and  growing  pool  of  recently  discovered  SARS- 
like  Co  Vs  with  a  wide  geographic  distribution. 

Our  findings  suggest  that  the  diversity  of  bat  Co  Vs  is  substantially 
higher  than  that  previously  reported.  In  this  study  we  were  able  to  demon¬ 
strate  the  circulation  of  at  least  seven  different  strains  of  SL-CoVs  within  a 
single  colony  of  R.  sinicus  during  a  12-month  period.  The  high  genetic 
diversity  of  SL-CoVs  within  this  colony  was  mirrored  by  high  pheno¬ 
typic  diversity  in  the  differential  use  of  ACE2  by  different  strains.  It 
would  therefore  not  be  surprising  if  further  surveillance  reveals  a  broad 
diversity  of  bat  SL-CoVs  that  are  able  to  use  ACE2,  some  of  which  may 
have  even  closer  homology  to  SARS-CoV  than  SL-CoV-WIVl.  Our 
results — in  addition  to  the  recent  demonstration  of  MERS-CoV  in  a 
Saudi  Arabian  bat25,  and  of  bat  CoVs  closely  related  to  MERS-CoV  in 
China,  Africa,  Europe  and  North  America3,26,27 — suggest  that  bat  coro- 
naviruses  remain  a  substantial  global  threat  to  public  health. 

Finally,  this  study  demonstrates  the  public  health  importance  of  path¬ 
ogen  discovery  programs  targeting  wildlife  that  aim  to  identify  the  ‘known 
unknowns’ — previously  unknown  viral  strains  closely  related  to  known 
pathogens.  These  programs,  focused  on  specific  high-risk  wildlife  groups 
and  hotspots  of  disease  emergence,  may  be  a  critical  part  of  future  global 
strategies  to  predict,  prepare  for,  and  prevent  pandemic  emergence28. 
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Figure  3  |  Analysis  of  receptor  usage  of  SL-CoV-WIVl  determined  by 
immunofluorescence  assay  and  real-time  PCR.  Determination  of  vims 
infectivity  in  HeLa  cells  with  and  without  the  expression  of  ACE2.  b,  bat; 
c,  civet;  h,  human.  ACE2  expression  was  detected  with  goat  anti-humanACE2 
antibody  followed  by  fluorescein  isothiocyanate  (FITC) -conjugated  donkey 
anti-goat  IgG.  Virus  replication  was  detected  with  rabbit  antibody  against  the 


SL-CoV  Rp3  nucleocapsid  protein  followed  by  cyanine  3  (Cy3)- conjugated 
mouse  anti-rabbit  IgG.  Nuclei  were  stained  with  DAPI  (4',6-diamidino-2- 
phenylindole).  The  columns  (from  left  to  right)  show  staining  of  nuclei  (blue), 
ACE2  expression  (green),  virus  replication  (red),  merged  triple- stained 
images  and  real-time  PCR  results,  respectively.  ( n  —  3);  error  bars  represent 
standard  deviation. 
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Figure  4  |  Analysis  of  host  range  of  SL-CoV-WIVl  determined  by 
immunofluorescence  assay  and  real-time  PCR.  Virus  infection  in  A549, 
RSKT,  Vero  E6  and  PK- 15  cells.  Virus  replication  was  detected  as  described  for 
Fig.  3.  The  columns  (from  left  to  right)  show  staining  of  nuclei  (blue),  virus 
replication  (red),  merged  double- stained  images  and  real-time  PCR  results, 
respectively,  n  —  3;  error  bars  represent  s.d. 

METHODS  SUMMARY 

Throat  and  faecal  swabs  or  fresh  faecal  samples  were  collected  in  viral  transport 
medium  as  described  previously14.  All  PCR  was  conducted  with  the  One-Step  RT- 
PCR  kit  (Invitrogen).  Primers  targeting  the  highly  conserved  regions  of  the  RdRP 
gene  were  used  for  detection  of  all  alphacoronaviruses  and  betacoronaviruses  as 
described  previously15.  Degenerate  primers  were  designed  on  the  basis  of  all  avail¬ 
able  genomic  sequences  of  SARS-CoVs  and  SL-CoVs  and  used  for  amplification  of 
the  RBD  sequences  of  S  genes  or  full-length  genomic  sequences.  Degenerate  primers 
were  used  for  amplification  of  the  bat  ACE2  gene  as  described  previously29.  PCR 
products  were  gel  purified  and  cloned  into  pGEM-T  Easy  Vector  (Promega).  At 
least  four  independent  clones  were  sequenced  to  obtain  a  consensus  sequence.  PCR- 
positive  faecal  samples  (in  200  pi  buffer)  were  gradient  centrifuged  at  3,000-  12,000g 
and  supernatant  diluted  at  1 : 1 0  in  DMEM  before  being  added  to  Vero  E6  cells.  After 
incubation  at  37  °C  for  1  h,  inocula  were  removed  and  replaced  with  fresh  DMEM 
with  2%  FCS.  Cells  were  incubated  at  37  °C  and  checked  daily  for  cytopathic  effect. 
Cell  lines  from  different  origins  were  grown  on  coverslips  in  24-well  plates  and 
inoculated  with  the  novel  SL-CoV  at  a  multiplicity  of  infection  of  10.  Virus  repli¬ 
cation  was  detected  at  24  h  after  infection  using  rabbit  antibodies  against  the  SL- 
CoV  Rp3  nucleocapsid  protein  followed  by  Cy3-conjugated  goat  anti-rabbit  IgG. 

Online  Content  Any  additional  Methods,  Extended  Data  display  items  and  Source 
Data  are  available  in  the  online  version  of  the  paper;  references  unique  to  these 
sections  appear  only  in  the  online  paper. 
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METHODS 

Sampling.  Bats  were  trapped  in  their  natural  habitat  as  described  previously5. 
Throat  and  faecal  swab  samples  were  collected  in  viral  transport  medium  (VTM) 
composed  of  Hank’s  balanced  salt  solution,  pH  7.4,  containing  BSA  (1%),  ampho¬ 
tericin  (15  pgml-1),  penicillin  G  (100  U  ml-1)  and  streptomycin  (50  pgml-1).  To 
collect  fresh  faecal  samples,  clean  plastic  sheets  measuring  2.0  by  2.0  m  were  placed 
under  known  bat  roosting  sites  at  about  18:00  h  each  evening.  Relatively  fresh  faecal 
samples  were  collected  from  sheets  at  approximately  05:30-06:00  the  next  morning 
and  placed  in  VTM.  Samples  were  transported  to  the  laboratory  and  stored  at 
—  80  °C  until  use.  All  animals  trapped  for  this  study  were  released  back  to  their 
habitat  after  sample  collection.  All  sampling  processes  were  performed  by  veter¬ 
inarians  with  approval  from  Animal  Ethics  Committee  of  the  Wuhan  Institute  of 
Virology  (WIVH052 10201)  and  EcoHealth  Alliance  under  an  inter-institutional 
agreement  with  University  of  California,  Davis  (UC  Davis  protocol  no.  16048). 
RNA  extraction,  PCR  and  sequencing.  RNA  was  extracted  from  140  pi  of  swab 
or  faecal  samples  with  a  Viral  RNA  Mini  Kit  (Qiagen)  following  the  manufacturer’s 
instructions.  RNA  was  eluted  in  60  pi  RNAse-free  buffer  (buffer  AVE,  Qiagen), 
then  aliquoted  and  stored  at  —80  °C.  One- step  RT-PCR  (Invitrogen)  was  used  to 
detect  coronavirus  sequences  as  described  previously15.  First  round  PCR  was  con¬ 
ducted  in  a  25-pl  reaction  mix  containing  12.5  pi  PCR  2X  reaction  mix  buffer, 
10  pmol  of  each  primer,  2.5  mM  MgS04,  20  U  RNase  inhibitor,  1  pi  Superscript 
III/  Platinum  Taq  Enzyme  Mix  and  5  pi  RNA.  Amplification  of  the  RdRP-gene  frag¬ 
ment  was  performed  as  follows:  50  °C  for  30  min,  94  °C  for  2  min,  followed  by  40 
cycles  consisting  of  94  °C  for  15  s,  62  °C  for  15  s,  68  °C  for  40  s,  and  a  final  exten¬ 
sion  of  68  °C  for  5  min.  Second  round  PCR  was  conducted  in  a  25-pl  reaction  mix 
containing  2.5  pi  PCR  reaction  buffer,  5  pmol  of  each  primer,  50  mM  MgCl2, 
0.5  mM  dNTP,  0.1  pi  Platinum  Taq  Enzyme  (Invitrogen)  and  1  pi  first  round 
PCR  product.  The  amplification  of  RdRP-gene  fragment  was  performed  as  fol¬ 
lows:  94  °C  for  5  min  followed  by  35  cycles  consisting  of  94  °C  for  30  s,  52  °C  for 
30  s,  72  °C  for  40  s,  and  a  final  extension  of  72  °C  for  5  min. 

To  amplify  the  RBD  region,  one-step  RT-PCR  was  performed  with  primers 
designed  based  on  available  SARS-CoV  or  bat  SL-CoVs  (first  round  PCR  primers; 
F,  forward;  R,  reverse:  CoVS931F-5'-VWGADGTTGTKAGRTTYCCT-3'  and 
CoVS1909R-5'-TAARACAVCCWGCYTGWGT-3';  second  PCR  primers:  Co  VS 
951F-5'-TGTKAGRTTYCCTAAYATTAC-3'  and  CoVS1805R-5'-ACATCYTG 
ATANARAACAGC-3 ' ) .  First-round  PCR  was  conducted  in  a  25-pl  reaction  mix 
as  described  above  except  primers  specific  for  the  S  gene  were  used.  The  ampli¬ 
fication  of  the  RBD  region  of  the  S  gene  was  performed  as  follows:  50  °C  for  30  min, 
94  °C  for  2  min,  followed  by  35  cycles  consisting  of  94  °C  for  15  s,  43  °C  for  15  s, 
68  °C  for  90  s,  and  a  final  extension  of  68  °C  for  5  min.  Second-round  PCR  was 
conducted  in  a  25-pl  reaction  mix  containing  2.5  pi  PCR  reaction  buffer,  5  pmol  of 
each  primer,  50  mM  MgCl2, 0.5  mM  dNTP,  0.1  pi  Platinum  Taq  Enzyme  (Invitrogen) 
and  1  pi  first  round  PCR  product.  Amplification  was  performed  as  follows:  94  °C 
for  5  min  followed  by  40  cycles  consisting  of  94  °C  for  30  s,  41  °C  for  30  s,  72  °C  for 
60  s,  and  a  final  extension  of  72  °C  for  5  min. 

PCR  products  were  gel  purified  and  cloned  into  pGEM-T  Easy  Vector  (Promega). 
At  least  four  independent  clones  were  sequenced  to  obtain  a  consensus  sequence 
for  each  of  the  amplified  regions. 

Sequencing  full-length  genomes.  Degenerate  coronavirus  primers  were  designed 
based  on  all  available  SARS-CoV  and  bat  SL-CoV  sequences  in  GenBank  and  specific 
primers  were  designed  from  genome  sequences  generated  from  previous  rounds  of 
sequencing  in  this  study  (primer  sequences  will  be  provided  upon  request).  All 
PCRs  were  conducted  using  the  One-Step  RT-PCR  kit  (Invitrogen).  The  5'  and  3' 
genomic  ends  were  determined  using  the  5'  or  3'  RACE  kit  (Roche),  respectively. 
PCR  products  were  gel  purified  and  sequenced  directly  or  following  cloning  into 
pGEM-T  Easy  Vector  (Promega).  At  least  four  independent  clones  were  sequenced 
to  obtain  a  consensus  sequence  for  each  of  the  amplified  regions  and  each  region 
was  sequenced  at  least  twice. 

Sequence  analysis  and  databank  accession  numbers.  Routine  sequence  manage¬ 
ment  and  analysis  was  carried  out  using  DNAStar  or  Geneious.  Sequence  align¬ 
ment  and  editing  was  conducted  using  ClustalW,  BioEdit  or  GeneDoc.  Maximum 
Likelihood  phylogenetic  trees  based  on  the  protein  sequences  were  constructed 
using  a  Poisson  model  with  bootstrap  values  determined  by  1,000  replicates  in  the 
MEGA5  software  package. 

Sequences  obtained  in  this  study  have  been  deposited  in  GenBank  as  follows 
(accession  numbers  given  in  parenthesis):  full-length  genome  sequence  of  SL-CoV 
RsSHC014  and  Rs3367  (KC881005,  KC881006);  full-length  sequence  of  WIV1  S 
(KC881007);  RBD  (KC880984-KC881003);  ACE2  (KC8810040).  SARS-CoV 
sequences  used  in  this  study:  human  SARS-CoV  strains  Tor2  (AY274119),  BJ01 
(AY278488),  GZ02  (AY390556)  and  civet  SARS-CoV  strain  SZ3  (AY304486).  Bat 
coronavirus  sequences  used  in  this  study:  Rs672  (FJ588686),  Rp3  (DQ071615),  Rfl 
(DQ412042),  Rml  (DQ412043),  HKU3-1  (DQ022305),  BM48-31  (NC_014470), 
HKU9-1  (NC  009021),  HKU4  (NC  009019),  HKU5  (NC  009020),  HKU8  (DQ249228), 


HKU2  (EF203067),  BtCoV512  (NC_009657),  1A  (NC_010437).  Other  coronavirus 
sequences  used  in  this  study:  HCoV-229E  (AF304460),  HCoV-OC43  (AY391777), 
HCoV-NL63  (AY567487),  HKU1  (NC  006577),  EMC  (JX869059),  FIPV  (NC  002306), 
PRCV  (DQ811787),  BWCoV  (NC  010646),  MHV  (AY700211),  IBV  (AY851295). 
Amplification,  cloning  and  expression  of  the  bat  ACE2  gene.  Construction  of 
expression  clones  for  human  and  civet  ACE2  in  pcDNA3.1  has  been  described 
previously29.  Bat  ACE2  was  amplified  from  a  R.  sinicus  (sample  no.  3357).  In  brief, 
total  RNA  was  extracted  from  bat  rectal  tissue  using  the  RNeasy  Mini  Kit  (Qiagen). 
First-strand  complementary  DNA  was  synthesized  from  total  RNA  by  reverse  trans¬ 
cription  with  random  hexamers.  Full-length  bat  ACE2  fragments  were  amplified 
using  forward  primer  bAF2  and  reverse  primer  bAR2  (ref.  29).  The  ACE2  gene  was 
cloned  into  pCDNA3.1  with  Kpnl  and  Xhol,  and  verified  by  sequencing.  Purified 
ACE2  plasmids  were  transfected  to  HeLa  cells.  After  24  h,  lysates  of  HeLa  cells 
expressing  human,  civet,  or  bat  ACE2  were  confirmed  by  western  blot  or  immu¬ 
nofluorescence  assay. 

Western  blot  analysis.  Lysates  of  cells  or  filtered  supernatants  containing  pseu¬ 
doviruses  were  separated  by  SDS-PAGE,  followed  by  transfer  to  a  nitrocellulose 
membrane  (Millipore).  For  detection  of  S  protein,  the  membrane  was  incubated 
with  rabbit  anti-Rp3  S  fragment  (amino  acids  561-666)  polyantibodies  (1:200), 
and  the  bound  antibodies  were  detected  by  alkaline  phosphatase  (AP) -conjugated 
goat  anti-rabbit  IgG  (1:1,000).  For  detection  of  HIV- 1  p24  in  supernatants,  mono¬ 
clonal  antibody  against  HIV  p24  (p24  MAb)  was  used  as  the  primary  antibody  at  a 
dilution  of  1 : 1 ,000,  followed  by  incubation  with  AP-conjugated  goat  anti-mouse  IgG 
at  the  same  dilution.  To  detect  the  expression  of  ACE2  in  HeLa  cells,  goat  antibody 
against  the  human  ACE2  ectodomain  ( 1 :500)  was  used  as  the  first  antibody,  followed 
by  incubation  with  horseradish  peroxidase-conjugated  donkey  anti-goat  IgG  (1:1 ,000) . 
Virus  isolation.  Vero  E6  cell  monolayers  were  maintained  in  DMEM  supplemen¬ 
ted  with  10%  FCS.  PCR-positive  samples  (in  200  pi  buffer)  were  gradient  centri¬ 
fuged  at  3,000-1 2, 000g,  and  supernatant  were  diluted  1:10  in  DMEM  before  being 
added  to  Vero  E6  cells.  After  incubation  at  37  °C  for  1  h,  inocula  were  removed  and 
replaced  with  fresh  DMEM  with  2%  FCS.  Cells  were  incubated  at  37  °C  for  3  days 
and  checked  daily  for  cytopathic  effect.  Double-dose  triple  antibiotics  penicillin/ 
streptomycin/amphotericin  (Gibco)  were  included  in  all  tissue  culture  media  (peni¬ 
cillin  200IUml-1,  streptomycin  0.2  mg  ml-1,  amphotericin  0.5  pg  mf1).  Three 
blind  passages  were  carried  out  for  each  sample.  After  each  passage,  both  the  culture 
supernatant  and  cell  pellet  were  examined  for  presence  of  virus  by  RT-PCR  using 
primers  targeting  the  RdRP  or  S  gene.  Virions  in  supernatant  (10  ml)  were  collected 
and  fixed  using  0.1%  formaldehyde  for  4  h,  then  concentrated  by  ultracentrifuga¬ 
tion  through  a  20%  sucrose  cushion  (5  ml)  at  80,000g  for  90  min  using  a  Ty90  rotor 
(Beckman).  The  pelleted  viral  particles  were  suspended  in  100  pi  PBS,  stained  with 
2%  phosphotungstic  acid  (pH  7.0)  and  examined  using  a  Tecnai  transmission 
electron  microscope  (FEI)  at  200  kV. 

Virus  infectivity  detected  by  immunofluorescence  assay.  Cell  lines  used  for  this 
study  and  their  culture  conditions  are  summarized  in  Extended  Data  T able  5.  Virus 
titre  was  determined  in  Vero  E6  cells  by  cytopathic  effect  (CPE)  counts.  Cell  lines 
from  different  origins  and  HeLa  cells  expressing  ACE2  from  human,  civet  or  Chinese 
horseshoe  bat  were  grown  on  coverslips  in  24-well  plates  (Corning)  incubated  with 
bat  SL-CoV-WIVl  at  a  multiplicity  of  infection  =  10  for  1  h.  The  inoculum  was 
removed  and  washed  twice  with  PBS  and  supplemented  with  medium.  HeLa  cells 
without  ACE2  expression  and  Vero  E6  cells  were  used  as  negative  and  positive 
controls,  respectively.  At  24  h  after  infection,  cells  were  washed  with  PBS  and  fixed 
with  4%  formaldehyde  in  PBS  (pH  7.4)  for  20  min  at  4  °C.  ACE2  expression  was 
detected  using  goat  anti-human  ACE2  immunoglobulin  (R&D  Systems)  followed 
by  FITC-labelled  donkey  anti-goat  immunoglobulin  (PTGLab).  Virus  replication 
was  detected  using  rabbit  antibody  against  the  SL-CoV  Rp3  nucleocapsid  protein 
followed  by  Cy3 -conjugated  mouse  anti-rabbit  IgG.  Nuclei  were  stained  with  DAPI. 
Staining  patterns  were  examined  using  a  FV1200  confocal  microscope  (Olympus). 
Virus  infectivity  detected  by  real-time  RT-PCR.  Vero  E6,  A549,  PK15,  RSKT 
and  HeLa  cells  with  or  without  expression  of  ACE2  of  different  origins  were  inocu¬ 
lated  with  0.1  TCID50  WIV-1  and  incubated  for  1  h  at  37  °C.  After  removing  the 
inoculum,  the  cells  were  cultured  with  medium  containing  1%  FBS.  Supernatants 
were  collected  at  0,  12,  24  and  48  h.  RNA  from  140  pi  of  each  supernatant  was 
extracted  with  the  Viral  RNA  Mini  Kit  (Qiagen)  following  manufacturer’s  instruc¬ 
tions  and  eluted  in  60  pi  buffer  AVE  (Qiagen).  RNA  was  quantified  on  the  ABI 
StepOne  system,  with  the  TaqMan  AgPath-ID  One-Step  RT-PCR  Kit  (Applied 
Biosystems)  in  a  25  pi  reaction  mix  containing  4  pi  RNA,  1  X  RT-PCR  enzyme 
mix,  1  X  RT-PCR  buffer,  40  pmol  forward  primer  (5'-GTGGTGGTGACGGCA 
AAATG-3'),  40 pmol  reverse  primer  (5'-AAGTGAAGCTTCTGGGCCAG-3') 
and  12  pmol  probe  (5 '  -FAM- AAAGAGCTCAGCCCCAGATG-BHQ 1 -3 ' ) .  Ampli¬ 
fication  parameters  were  10  min  at  50  °C,  10  min  at  95  °C  and  50  cycles  of  15  s  at  95  °C 
and  20  s  at  60  °C.  RNA  dilutions  from  purified  WIV-1  stock  were  used  as  a  standard. 
Serum  neutralization  test.  SARS  patient  sera  were  inactivated  at  56  °C  for  30  min 
and  then  used  for  virus  neutralization  testing.  Sera  were  diluted  starting  with  1:10 
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and  then  serially  twofold  diluted  in  96-well  cell  plates  to  1:40.  Each  100  pi  serum 
dilution  was  mixed  with  100  pi  viral  supernatant  containing  100  TCID50  of  WIV 1 
and  incubated  at  37  °C  for  1  h.  The  mixture  was  added  in  triplicate  wells  of  96-well 
cell  plates  with  plated  monolayers  of  Vero  E6  cells  and  further  incubated  at  37  °C 
for  2  days.  Serum  from  a  healthy  blood  donor  was  used  as  a  negative  control  in 
each  experiment.  CPE  was  observed  using  an  inverted  microscope  2  days  after 
inoculation.  The  neutralizing  antibody  titre  was  read  as  the  highest  dilution  of 
serum  which  completely  suppressed  CPE  in  infected  wells.  The  neutralization  test 
was  repeated  twice. 

Recombination  analysis.  Full-length  genomic  sequences  of  SL-CoV  Rs3367  or 
RsSHC014  were  aligned  with  those  of  selected  SARS-CoVs  and  bat  SL-CoVs  using 
Clustal  X.  The  aligned  sequences  were  preliminarily  scanned  for  recombination 


events  using  Recombination  Detection  Program  (RDP)  4.0  (ref.  19).  The  potential 
recombination  events  suggested  by  RDP  owing  to  their  strong  Rvalues  (<  10-20) 
were  investigated  further  by  similarity  plot  and  bootscan  analyses  implemented  in 
Simplot  3.5.1.  Phylogenetic  origin  of  the  major  and  minor  parental  regions  of 
Rs3367  or  RsSHC014  were  constructed  from  the  concatenated  sequences  of  the 
essential  ORFs  of  the  major  and  minor  parental  regions  of  selected  SARS-Co  V  and 
SL-CoVs.  Two  genome  regions  between  three  estimated  breakpoints  (20,827- 
26,553  and  26,554-28,685)  were  aligned  independently  using  ClustalX  and  gene¬ 
rated  two  alignments  of  5,727  base  pairs  and  2,133  base  pairs.  The  two  alignments 
were  used  to  construct  maximum  likelihood  trees  to  better  infer  the  fragment 
parents.  All  nucleotide  numberings  in  this  study  are  based  on  Rs3367  genome 
position. 
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Extended  Data  Figure  1  |  Sequence  alignment  of  CoV  S  protein  RBD. 

SARS-CoV  S  protein  (amino  acids  310-520)  is  aligned  with  homologous 
regions  of  bat  SL-CoVs  using  ClustalW.  The  newly  discovered  bat  SL-CoVs  are 


indicated  with  a  bold  vertical  line  on  the  left.  The  key  amino  acid  residues 
involved  in  the  interaction  with  human  ACE2  are  numbered  on  the  top  of  the 
aligned  sequences. 
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Extended  Data  Figure  2  |  Alignment  of  CoV  S  protein  SI  sequences. 

Alignment  of  SI  sequences  (amino  acids  1-660)  of  the  two  novel  bat  SL-CoV  S 
proteins  with  those  of  previously  reported  bat  SL-CoVs  and  human  and 
civet  SARS-CoVs.  The  newly  discovered  bat  SL-CoVs  are  boxed  in  red. 
SARS-CoV  GZ02,  BJ01  and  Tor2  were  isolated  from  patients  in  the  early, 
middle  and  late  phase,  respectively,  of  the  SARS  outbreak  in  2003.  SARS-CoV 


SZ3  was  identified  from  P.  larvata  in  2003  collected  in  Guangdong,  China. 
SL-CoV  Rp3,  Rs  672  and  HKU3-1  were  identified  from  R.  sinicus  collected  in 
Guangxi,  Guizhou  and  Hong  Kong,  China,  respectively.  Rfl  and  Rml  were 
identified  from  R.  ferrumequinum  and  R.  macrotis,  respectively,  collected  in 
Hubei  Province,  China.  Bat  SARS-related  CoV  BM48-31  was  identified  from 
R.  blasii  collected  in  Bulgaria. 
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Extended  Data  Figure  3  |  Complete  RdRP  sequence  phylogeny. 

Phylogenetic  tree  of  bat  SL-CoVs  and  SARS-CoVs  on  the  basis  of  complete 
RdRP  sequences  (2,796  nucleotides).  Bat  SL-CoVs  RsSHC014  and  Rs3367  are 
highlighted  by  filled  circles.  Three  established  coronaivirus  genera, 
Alphacoronavirus,  Betacoronavirus  and  Gammacoronavirus  are  marked  as  a,  (3 


and  y,  respectively.  Four  CoV  groups  in  the  genus  Betacoronavirus  are 
indicated  as  A,  B,  C  and  D,  respectively.  MHV,  murine  hepatitis  virus; 
PHEV,  porcine  haemagglutinating  encephalomyelitis  virus;  PRCV,  porcine 
respiratory  coronavirus;  FIPV,  feline  infectious  peritonitis  virus;  IBV, 
infectious  bronchitis  coronavirus;  BW,  beluga  whale  coronavirus. 


©2013  Macmillan  Publishers  Limited.  All  rights  reserved 


RESEARCH 


LETTER 


0.5 

Extended  Data  Figure  4  |  Sequence  phylogeny  of  the  complete  S  protein  of  Bat  SL-CoVs  RsSHC014  and  Rs3367  are  highlighted  by  filled  circles.  Bat  CoV 
SL-CoVs  and  SARS-CoV.  Phylogenetic  tree  of  bat  SL-CoVs  and  SARS-CoVs  HKU9  was  used  as  an  outgroup, 
on  the  basis  of  complete  S  protein  sequences  (1,256  amino  acids). 
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Extended  Data  Figure  5  |  Detection  of  potential  recombination  events. 

a,  b,  Similarity  plot  (a)  and  bootscan  analysis  (b)  detected  three  recombination 
breakpoints  in  the  bat  SL-CoV  Rs3367  or  SHC014  genome.  The  three 
breakpoints  were  located  at  the  ORFlb  (nt  20,827),  M  (nucleotides  26,553)  and 


N  (nucleotides  28,685)  genes,  respectively.  Both  analyses  were  performed  with 
an  F84  distance  model,  a  window  size  of  1,500  base  pairs  and  a  step  size  of 
300  base  pairs. 
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Extended  Data  Table  1  |  Summary  of  sampling  detail  and  CoV  prevalence 


Sampling  time 

Total  number  of  swab  or  fecal  samples 

collected 

Number  of  CoV  PCR  positive  samples  (%) 

April,  2011 

14 

1(7.1) 

October,  201 1 

10 

3(30) 

May,  2012 

54 

4  (7.4) 

September,  2012 

39 

19(48.7) 
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Extended  Data  Table  2  |  Genomic  sequence  identities  of  bat  SL-CoVs  with  SARS-CoVs 

Pairwise  genomic  nucleotide  acids  identity  (%) 


CoVs 

Genome 

size  (nt) 

Bat  SARS-Like  CoVs 

Human  and  civet 

SARS-CoVs 

SHC014 

Rs672 

Rp3 

Rfl 

Rml 

HKU3-1 

BM48-31 

GZ02 

BJ01 

Tor2 

SZ3 

3367 

29,787 

98.8 

92.5 

93.2 

87.3 

88.0 

87.8 

76.9 

95.4 

95.3 

95.4 

95.3 

SHC014 

29,787 

- 

92.6 

93.2 

87.3 

88.1 

87.8 

77.0 

95.2 

95.1 

95.1 

95.1 

Rs672 

29,059 

- 

92.4 

86.2 

87.4 

87.0 

75.2 

90.9 

90.9 

90.8 

91.0 

Rp3 

29,736 

- 

88.3 

90.3 

89.6 

77.0 

92.1 

92.0 

92.1 

92.0 

Rfl 

29,709 

- 

89.4 

88.4 

76.6 

87.2 

87.1 

87.2 

87.1 

Rml 

29,749 

- 

90.1 

76.4 

87.6 

87.5 

87.5 

87.5 

HKU3-1 

29,728 

- 

76.8 

87.4 

87.3 

87.4 

87.3 

BM48-31 

29,276 

- 

76.9 

77.1 

77.0 

76.9 

GZ02 

29,760 

- 

99.6 

99.6 

99.7 

BJ01 

29,725 

- 

99.8 

99.6 

Tor2 

29,751 

- 

99.5 

SZ3 

29,741 

- 
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Extended  Data  Table  3  |  Genomic  annotation  and  comparison  of  bat  SL-CoV  Rs3367  with  human/civet  SARS-CoVs  and  other  bat  SL-CoVs 


ORF  Identity  nt/aa  (%) 


Human  and  civet  SARS-CoVs 

Bat  SARS-like  CoVs 

ORFs 

Start-End  (nt.) 

No.  of 

Nt. 

No.  of 
Aa. 

TRS 

GZO 

2 

BJ01 

Tor2 

SZ3 

Rs672 

Rp3 

Rfl 

Rml 

HKU3-1 

BM48-31 

Pla 

265-13,398 

13,134 

4,377 

ACGAAC  AUG 

192 

96.7/97.9 

96.6/97.9 

96.8/97.9 

96.8/98.1 

93.3/94.2 

95.5/96.9 

88.1/94.0 

87.9/93.3 

87.9/94.2 

76.3/80.8 

Plb 

13,398-21,485 

8,088 

2,695 

96.3/99.2 

96.3/99.2 

96.3/99.2 

96.3/99.2 

97.2799.2 

97.2/99.2 

90.6/98.4 

91.0/98.7 

90.7/98.5 

83.4/93.7 

S 

21,492-25,262 

3,771 

1,256 

ACGAACAUG 

88.3/90.1 

88.2790.0 

88.1/89.8 

88.2/90.0 

76.5/78.2 

76.0/79.1 

74.0/77.4 

76.3/79.1 

75.6/78.2 

70.2/74.5 

(SI)* 

21,493-23,535 

2,043 

681 

78.2/81.1 

78.2780.9 

78.1/80.6 

78.2/81.1 

65.1/62.2 

63.9/63.0 

62.9/62.5 

64.7/63.3 

65.2/63.4 

62.2/64.7 

(S2)* 

23,536-25,263 

1,728 

575 

98.1/99.3 

98.1/99.3 

98.1/99.3 

98.1/99.1 

87.9/94.8 

88.1/95.8 

85.1/92.7 

87.9/95.4 

86/93.5 

76.6/88.2 

ORF3a 

25,271-26,095 

825 

274 

ACGAAC  AUG 

2 

99.2/98.1 

98.6/97.0 

98.7/97.0 

98.5/96.7 

90.4/90.8 

84.1/84.3 

88.8/86.8 

83.6/84.3 

83.1/82.4 

72.1/71.2 

ORF3b 

25,692-26,036 

345 

114 

99.1/99.1 

98.2798.2 

98.2/98.2 

97.9/97.3 

99.1/98.2 

N/D 

82.6/92.1 

N/D 

N/D 

N/D 

E 

26,120-26,350 

231 

76 

ACGAAC  AUG 

2 

98.7/98.6 

98.7/98.6 

98.7/98.6 

98.7/98.6 

99.1/98.6 

97.8/98.6 

96.5/96.0 

96.1/97.3 

97.4/98.6 

91.3/93.4 

M 

26,401-27,066 

666 

221 

ACGAAC  AUG 

97.4/98.1 

97.2798.1 

97.2/98.1 

97.2/97.7 

98.7/99.5 

93.3/98.1 

96.3/98.6 

93.2/95.4 

93.9/96.8 

78.5/88.1 

ORF6 

27,077-27,268 

192 

63 

ACGAAC  AUG 

156 

97.3/95.2 

96.8/93.6 

97.3/95.2 

97.3/95.2 

97.3/96.8 

95.8/92.0 

94.2/92.0 

95.3/92 

94.7/90.4 

63.5/49.2 

ORF7a 

27,276-27,644 

369 

122 

ACGAACAUG 

94.5/95.9 

94.5/95.9 

94.5/95.9 

94.5/95.9 

97.8/100 

96.2/99.1 

92.9/95.0 

93.4/97.5 

93.2/97.5 

62.3/58.1 

ORF7b 

27,641-27,776 

135 

44 

96.2/93.1 

96.2793.1 

96.2/93.1 

96.2/93.1 

99.2/100 

99.2/100 

97.7/97.7 

99.2/100 

93.3/95.4 

62.9/63.6 

ORF8 

27,782-28,147 

366 

121 

ACGAACAUG 

47.1/46.3 

N/A 

N/A 

47.1/46.3 

97.8/100 

85.2/90.2 

46.2/39.0 

85.7/90.2 

85.7/85.3 

N/A 

N 

28,162-29,430 

1,269 

422 

ACGAAC  AUG 

8 

98.3/99.5 

98.4/99.5 

98.4/99.5 

98.4/99.5 

98/98.5 

96.6/97.6 

93.7/95.2 

96.2/97.1 

95.9/96.2 

77.9/87.2 

s2m 

29,628-29,668 

41 

97.5 

97.5 

97.5 

97.5 

100 

100 

100 

100 

100 

95.1 

*S1,  the  N-terminal  domain  of  the  coronavirus  S  protein  responsible  for  receptor  binding.  S2,  the  S  protein  C-terminal  domain  responsible  for  membrane  fusion. 

The  ORFs  in  the  genome  were  predicted  and  potential  protein  sequences  were  translated.  The  pairwise  comparisons  were  conducted  for  all  ORFs  at  nucleotide  acids  (nt)  and  amino  acids  (aa)  levels.  The  s2m  were  compared  at  nt 
level.  TRS:  Transcription  regulating-sequences;  N/D,  not  done;  N/A,  not  available. 
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Extended  Data  Table  4  |  Genomic  annotation  and  comparison  of  bat  SL-CoV  RsSHC014  with  human/civet  SARS-CoVs  and  other  bat  SL-CoVs 


ORF  Identity  nt/aa  (%) 


Human  and  civet  SARS-CoVs 

Bat  SARS-like  CoVs 

ORFs 

Start-End  (nt.) 

No.  of 

Nt. 

No.  of 
Aa. 

TRS 

GZ0 

2 

BJ01 

Tor2 

SZ3 

Rs672 

Rp3 

Rfl 

Rml 

HKU3-1 

BM48-31 

Pla 

265-13,398 

13,134 

4,377 

ACGAAC  AUG 

192 

96.7/97.9 

96.6/97.9 

96.8/97.9 

96.8/98.1 

93.3/94.2 

95.5/96.9 

88.1/94.0 

87.9/93.3 

87.9/94.2 

76.3/80.8 

Plb 

13,398-21,485 

8,088 

2,695 

96.3/99.2 

96.3/99.2 

96.3/99.2 

96.3/99.2 

97.2/99.2 

97.2/99.2 

90.6/98.4 

91.0/98.7 

90.7/98.5 

83.4/93.7 

S 

21,492-25,262 

3,771 

1,256 

ACGAACAUG 

88.3/90.1 

88.2790.0 

88.1/89.8 

88.2/90.0 

76.5/78.2 

76.0/79.1 

74.0/77.4 

76.3/79.1 

75.6/78.2 

70.2/74.5 

(SI)* 

21,493-23,535 

2,043 

681 

78.2/81.1 

78.2780.9 

78.1/80.6 

78.2/81.1 

65.1/62.2 

63.9/63.0 

62.9/62.5 

64.7/63.3 

65.2/63.4 

62.2/64.7 

(S2)  * 

23,536-25,263 

1,728 

575 

98.1/99.3 

98.1/99.3 

98.1/99.3 

98.1/99.1 

87.9/94.8 

88.1/95.8 

85.1/92.7 

87.9/95.4 

86/93.5 

76.6/88.2 

ORF3a 

25,271-26,095 

825 

274 

ACGAAC  AUG 

2 

99.2/98.1 

98.6/97.0 

98.7/97.0 

98.5/96.7 

90.4/90.8 

84.1/84.3 

88.8/86.8 

83.6/84.3 

83.1/82.4 

72.1/71.2 

ORF3b 

25,692-26,036 

345 

114 

99.1/99.1 

98.2798.2 

98.2/98.2 

97.9/97.3 

99.1/98.2 

N/D 

82.6/92.1 

N/D 

N/D 

N/D 

E 

26,120-26,350 

231 

76 

ACGAAC  AUG 

2 

98.7/98.6 

98.7/98.6 

98.7/98.6 

98.7/98.6 

99.1/98.6 

97.8/98.6 

96.5/96.0 

96.1/97.3 

97.4/98.6 

91.3/93.4 

M 

26,401-27,066 

666 

221 

ACGAAC  AUG 

97.4/98.1 

97.2798.1 

97.2/98.1 

97.2/97.7 

98.7/99.5 

93.3/98.1 

96.3/98.6 

93.2/95.4 

93.9/96.8 

78.5/88.1 

ORF6 

27,077-27,268 

192 

63 

ACGAAC  AUG 

155 

97.3/95.2 

96.8/93.6 

97.3/95.2 

97.3/95.2 

97.3/96.8 

95.8/92.0 

94.2/92.0 

95.3/92 

94.7/90.4 

63.5/49.2 

ORF7a 

27,276-27,644 

369 

122 

ACGAACAUG 

94.5/95.9 

94.5/95.9 

94.5/95.9 

94.5/95.9 

97.8/100 

96.2/99.1 

92.9/95.0 

93.4/97.5 

93.2/97.5 

62.3/58.1 

ORF7b 

27,641-27,776 

135 

44 

96.2/93.1 

96.2793.1 

96.2/93.1 

96.2/93.1 

99.2/100 

99.2/100 

97.7/97.7 

99.2/100 

93.3/95.4 

62.9/63.6 

ORF8 

27,782-28,147 

366 

121 

ACGAACAUG 

47.1/46.3 

N/A 

N/A 

47.1/46.3 

97.8/100 

85.2/90.2 

46.2/39.0 

85.7/90.2 

85.7/85.3 

N/A 

N 

28,162-29,430 

1,269 

422 

ACGAAC  AUG 

8 

98.3/99.5 

98.4/99.5 

98.4/99.5 

98.4/99.5 

98/98.5 

96.6/97.6 

93.7/95.2 

96.2/97.1 

95.9/96.2 

77.9/87.2 

s2m 

29,628-29,668 

41 

97.5 

97.5 

97.5 

97.5 

100 

100 

100 

100 

100 

95.1 

*S1,  the  N-terminal  domain  of  the  coronavirus  S  protein  responsible  for  receptor  binding.  S2,  the  S  protein  C-terminal  domain  responsible  for  membrane  fusion. 

The  ORFs  in  the  genome  were  predicted  and  potential  protein  sequences  were  translated.  The  pairwise  comparisons  were  conducted  for  all  ORFs  at  nucleotide  acids  (nt)  and  amino  acids  (aa)  levels.  The  s2m  were  compared  at  nt 
level.  TRS:  Transcription  regulating-sequences;  N/D,  not  done;  N/A,  not  available. 
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Extended  Data  Table  5  |  Cell  lines  used  for  virus  isolation  and  susceptibility  tests 


Cell  lines 

Species  (organ)  origin 

Medium 

Infectivity 

293T 

Human  (kidney) 

DMEM+10%FBS 

- 

Hela 

Human  (cervix) 

- 

VeroE6 

Monkey  (kidney) 

+ 

PK15 

Pig  (kidney) 

+ 

BHK21 

Hamster  (kidney) 

- 

A549 

Human  (alveolar  basal  epithelial) 

+ 

BK 

Myotis  davidii  (kidney) 

RPM 1 1 640+ 1 0%FBS 

- 

RSKT 

Rhinolophus  sinicus  (kidney) 

DMEM/F1 2+1 0%FBS 

+ 

MCKT 

Myotis  chinensis  (kidney) 

- 

PaKi 

Pteropus  alecto  (kidney) 

- 

RLK 

Rousettus  leschenaulti  (kidney) 

- 

*  Infectivity  was  determined  by  the  presence  of  viral  antigen  detected  by  immunofluorescence  assay. 
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